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Bacteria and Archaea encode clustered, regularly interspaced, short palindromic repeat (CRISPR) systems to confer 
adaptive immunity to invasive viruses and plasmids. Recent studies of CRISPR systems revealed that diverse CRISPR- 
associated (Cas) interference modules often coexist in different organisms but functions of cas genes have not been 
dissected in any of these systems. The crenarchaeon Sulfolobus islandicus encodes three distinct CRISPR interference 
modules, including a type IA system and two type 1MB systems: Cmr-ct and Cmr-(3. To study the genetic determinants 
of protospacer-adjacent motif (PAM)-dependent DNA targeting activity and mature CRISPR RNA (crRNA) production in 
this organism, mutants deleting individual genes of the type IA system or removing each of other Cas modules were 
constructed. Characterization of these mutants revealed that Cas7, Cas5, Cas6, Cas3' and Cas3" are essential for PAM- 
dependent DNA targeting activity, whereas Csa5, along with all other Cas modules, is dispensable for the targeting in the 
crenarchaeon. Cas6 is implicated as the only enzyme for pre-crRNA processing and the crRNA maturation is independent 
of the DNA targeting activity. Importantly, we show that Cas7 and Cas5 are essential for stabilizing the processing 
intermediates and mature crRNAs, respectively, and that depleting the helicase or nuclease domain of Cas3 leads to the 
accumulation of processing intermediates. This demonstrates that in addition to Cas6, other Cas proteins of an archaeal 
type IA system also contribute to crRNA processing. 



Introduction 

Clustered regularly interspaced short palindromic repeats 
(CRISPR) and CRISPR-associated (Cas) proteins constitute the 
adaptive and inheritable immune system in almost all archaea 
and more than 40% of bacteria. The antiviral nature of these 
systems was predicted based on bioinformatic analyses of bac- 
terial and archaeal genomes and their genetic elements, which 
predicted that unique DNA sequences (spacers) of CRISPR 
arrays were derived from invasive genetic elements 1 " 3 and it was 
further hypothesized that the defense could occur via an RNA 
interference-like mechanism. 4 Subsequently, in vivo CRISPR 
activity has been studied in different microorganisms demon- 
strating that immune response by CRISPR/Cas systems exerts 
through three distinct stages: 5 " 10 (1) Pieces of DNA sequence of 
invading genetic elements are incorporated into a CRISPR locus 
at a position between the leader and the first spacer, generating 
new spacers; (2) Transcription of CRISPR arrays from their lead- 
ers gives precursor CRISPR RNA molecules (pre-crRNAs) that 
are processed to yield mature CRISPR RNAs (crRNAs) and (3) 
crRNAs and Cascade (CRISPR-associated complex for antiviral 
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defense) form ribonucleoprotein complexes, exerting silencing of 
invasive nucleic acids. 

CRISPR/Cas systems are very diverse and their reevaluation 
has led to the classification of three main classes, type I, II and 
III, which are further divided into subgroups. 7 Recently, Casl and 
Cas2, the only universally conserved Cas proteins, 4,7,11 formed a 
complex with Csal and Cas4 in Thermoproteus tenax, which was 
implicated in acquisition of new spacers, and for this reason they 
were classified as a new Cas module, CRISPR-associated cluster 
for integration of new spacers (Cascis). 12 Following the first dem- 
onstration of spacer acquisition in Streptococcus thermophilus,^ 
insertion of new spacers between the leader sequence and the first 
spacer of CRISPR arrays has been shown in Escherichia coli 14 ' 16 
and Sulfolobus solfztaricus.' 7 Moreover, it has been shown that 
Casl and Cas2 are sufficient to yield spacer insertion in E. coli. H 

Small RNA products generated from CRISPR arrays were first 
detected a decade ago in Archaeoglobus fulgidus ls and later were 
found to be widespread in microorganisms. CRISPR arrays often 
consist of a large cluster of spacer-repeat units. Transcription of an 
entire CRISPR array gives pre-crRNA to be processed into effec- 
tor crRNAs. 19 " 22 Two distinct mechanisms are known to generate 
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Figure 1. S. islandicus Cas modules and construction knockouts lacking a Cas module or a cos gene. (A) Schematic of CRISPR arrays and Cas modules in 
S. islandicus REY15A. (B) PCR verification of mutant alleles of Cas modules and cas genes in the mutants constructed with the MID procedure, described 
previously. 66 PCR was conducted with specific primers for each Cas module or cas gene with genomic DNAs extracted from the genetic host S. islandi- 
cus E233S (wt) and corresponding deletion mutant (A). M, DNA size marker. Cascis, CRISPR-associated cluster for integration of new spacers; IA, type IA 
interference module; Cmr-a and Cmr-p, two type NIB modules containing 6 and 7 cmrgenes, respectively. 47 



crRNAs. The first is endonuclease Cas6, a processing enzyme for 
type I and type III CRISPR systems, 23 " 27 which binds to a repeat 
of a pre-crRNA and cleaves it. In fact, the enzyme remains bound 
to the RNA after cleavage. 23 ' 28 The second is RNase III that does 
the cleavage under the guidance of a trans-acting tracrRNA in 
type II systems. 29 

At the final stage, it is believed that Cascade complexes con- 
taining crRNAs recruit Cas3 helicase/nuclease and exert invader 
silencing, 25,30 " 32 during which a target nucleic acid is recognized by 
complementarity between the protospacer and the corresponding 
effector crRNA. DNA targeting activity has been demonstrated 
for representative systems belonging to each of the main types of 
CRISPR systems, including the E. coli type IE crRNA-Cascade 
ribonucleoprotein complex in vitro, 33 DNA targeting activity of 
type II systems in S. thermophilus in vivo 34 and in vitro 35 and the 
Staphylococcus epidermidis type IIIA system Csm. 36 Furthermore, 
P. furiosus and S, solfataricus type IIIB CRISPR repeat-associated 
mysterious proteins (RAMP), 11 also named Cmr modules, medi- 
ate RNA targeting activity. 32,37,38 



An important point in the antiviral defense is how CRISPR 
systems distinguish self vs. non-self DNA. Most known CRISPR 
systems recognize an invasive genetic element by identifying a 
protospacer-adjacent motif (PAM) located either at 5'- or at 
3'-flanking position to a protospacer of the genetic element. 9 PAM 
motifs were first recognized from genome sequence analyses, 20,39 
and subsequently experimentally demonstrated by virus infection 
in bacteria 40,41 or by interference plasmid assay in archaea. 42 " 44 
The other type of self vs. non-self discrimination involves repeat 
protection of chromosome-encoded CRISPR arrays from the 
DNA interference by the Csm module in S. epidermidis. 45 

All known Sulfolobus strains encode multiple CRISPR interfer- 
ence modules belonging to type IA or type IIIA or IIIB, whereas 
the number and types of modules carried in each strain vary from 
one to another. 46 In the recently completed genome of S. islandicus 
REY15A, 47 there are two arrays of CRISPR elements with the same 
repeat, a Cascis locus with five genes, a type IA CRISPR inter- 
ference system and two distinct type IIIB Cmr systems (Fig. 1). 
This organism is one of the genetically tractable archaeal models 48 
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Figure 2. Cascis, Cmr-a and Cmr-(3 modules are dispensable for pre- 
crRNA processing and PAM motif-dependent DNA silencing. (A and B) 
Northern analyses of total RNAs prepared from the S. islandicus wild- 
type strain (wt) and knockouts of Cas modules using oligonucleotide of 
the repeat (A) or that of spacer 32 of array 2 (B) as a probe. Band of ca. 
60-70 nt corresponds to the primary cleavage products of pre-crRNAs 
by Cas6. Mature crRNAs (ca. 40-60 nt) of A2S32 are also indicated. 
Unfilled arrowhead indicates the location of the smallest crRNA. EtBr, 
RNA staining with ethidium bromide as a loading control. (C) Plasmid 
interference assay of Cas module knockouts. The genetic host E233S 
(wt) and Cas module knockouts were used as hosts for plasmid transfor- 
mation. Two artificial interference plasmids, pCC-A1S6 and pCC-A2S32, 
were used to test for PAM-dependent DNA interference. In the cor- 
responding reference plasmids, the PAM motif (5'-CCA-3') was replaced 
with 5-GAAAG-3' (pRt-A1S6) or 5-ATTGAAAG-3' (pRt-R2S32). Trans- 
formation efficiencies of each interference plasmid were expressed as 
relative values to the efficiencies of corresponding reference plasmids, 
the latter of which were set to 1 .0. 



with which a novel gene knockout method has been established 49 
and plasmid interference assays were developed for testing inter- 
ference activities. 42,50 This has enabled us, for the first time, to 
conduct genetic analyses of cas gene functions in a microbe car- 
rying complex CRISPR/Cas systems. Deletion mutants lacking 
Cascis, Cmr-a or Cmr-(3 module and those deleting each of the 
genes encoding components of the putative I A- Cascade complex 
were constructed and analyzed pre-crRNA processing and CC 



PAM motif-mediated targeting. We show that type IA module 
is responsible for the PAM-dependent interference. Evidence has 
been provided to demonstrate that other Cas proteins also play a 
role in modulating pre-crRNA processing by Cas6. 

Results 

Cascis, Cmr-a and Cmr-(3 modules are dispensable for pre- 
crRNA processing and CC PAM motif DNA targeting. 

Previously, it was shown that CRISPR arrays were transcribed 
and processed into short RNA molecules in different Sulfolobus 
species 20,44,51 and that Sulfolobus CRISPR systems conferred 
DNA targeting activity. 42 However, as Sulfolobus contain mul- 
tiple Cas interference modules, functions of each Cas module 
and individual cas genes in these molecular processes have not 
been studied. Here, we systematically studied possible functions 
of four Cas modules in S. islandicus REY15A, i.e., a type IA 
interference module, a Cas cluster for integration of new spacers 
(Cascis) and two type IHB modules Cmr-a and Cmr-(3 modules 
(Fig. IA). Mutant strains lacking each entire module of Cascis, 
Cmr-a or Cmr-(3 were constructed, from which total RNAs were 
prepared and subsequently analyzed by northern hybridization as 
described in Materials and Methods. 

Results of the northern hybridization of RNAs prepared from 
the wild-type strain were very similar to those of the northern 
analyses of the mutants (Fig. 2A and B). The radiolabeled repeat 
oligonucleotide probe hybridized to RNAs of 60—70 nt in size. 
These RNAs are comparable in size with the mature crRNAs 
produced by an S. solfataricus Cas6, the primary product of pre- 
crRNAs was processed by Cas6, yielding spacers carrying 8 nt 
5'-handle and 16 nt 3'-handle of the repeat. 25 A few processing 
intermediates were also hybridized by the repeat probe showing 
the sizes equivalent to the RNAs carrying 3, 5 and 7 spacers 
(Fig. 2A). When analyzed with an oligonucleotide of array 2 
spacer 32 as probe, an RNA of ca. 62 nt as well as RNAs < 
60 nt were identified (Fig. 2B). The former could represent 
a primary processing product of Cas6 cleavage and the latter 
should be resulted from further processing/maturation of the 
primary product. Again, the probe identified the same RNA 
bands in ACascis, ACmr-a and ACmr-(3, comparing with the 
wild-type strain except for the smallest crRNA species (ca. 40 or 
41 nt) that is absent from ACmr-a (Fig. 2B), a result consistent 
with the functional analysis of Cmr-a in this crenarchaeon. 50 
Furthermore, none of these bands was present in the hybridiza- 
tion of the total RNAs prepared from ACRISPR, a strain lack- 
ing both arrays of CRISPR, indicating that the hybridization 
was specific for both probes. Taken together, this indicated that 
whereas ACmr-a is responsible for the maturation of the small- 
est crRNA, none of these Cas modules play an important role 
in pre-crRNA processing in this archaeon. 

These strains were then tested for DNA targeting mediated 
by CC PAM motif using two different interference plasmids, 
pCC-A2S32 and pCC-AlS6. These plasmids carry the artificial 
protospacer sequence of array 2 spacer 32 (A2S32) or that of 
array 1 spacer 6 (A1S6) where each spacer is preceded by the 
PAM motif (5'-CCA-3') of Sulfolobus, , 20 In the corresponding 
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Figure 3. Functions of Cas proteins of type IA interference module in pre-crRNA processing and PAM-dependent DNA silencing. (A and B) Northern 
analyses of total RNAs prepared from the 5. islandicus wild-type strain (wt), knockouts (A) of each cas gene of type IA interference module and their 
complementation strains (C) using oligonucleotide of the repeat (A) or that of spacer 32 of array 2 (B) as a probe. Band of ca. 60-70 nt corresponds to 
the primary cleavage products of pre-crRNAs by Cas6. Mature crRNAs (ca. 40-60 nt) of A2S32 are also indicated. Unfilled arrowhead indicates the loca- 
tion of the smallest crRNA. EtBr, RNA staining with ethidium bromide as a loading control. (C) Plasmid interference assay of knockout mutants using 
pCC-A2S32 and pCC-A1S6 as interference plasmids and pRt-A2S32 and pRt-A1S6 as reference plasmids. Transformation efficiencies of each interference 
plasmid were expressed as relative values to the efficiencies of corresponding reference plasmids, the latter of which were set to 1.0. 



reference plasmids, each PAM motif was replaced with 5 or 
8 nt of the repeat sequence, yielding a structure resembling the 
genetic organization of CRISPR arrays in the chromosome. As 
shown in Figure 2C, when each mutant was employed as host 
for transformation, hundreds to thousands fold lower rates of 
transformation were yielded with pCC-A2S32 and pCC-AlS6 
comparing with their reference plasmids, indicating none of 
these modules is essential for the CC motif-mediated DNA tar- 
geting activity. 



Functions of Cas6, Cas7, Cas5 and Csa5, components of 
the aCASCADE complex in S. islandicus. Recently, an archaeal 
Cascade complex was isolated from S. solfataricus. 25 It con- 
tained Cas7, Cas5, Cas6 and Csa5 and was loaded with crRNAs 
of 60-70 nt, which correspond to primary cleavage products 
by Cas6. Each of these Cascade proteins has a homolog in S. 
islandicus REY15A showing 82-93% amino acid sequence iden- 
tity. Since both S. solfataricus and S. islandicus exhibited PAM- 
dependent DNA targeting activity, we were interested in studying 
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possible functions of these Cas proteins in this process. In-frame 
deletion mutants were constructed for all four cas genes, giving 
Acas7, Acas5, Acsa5 and Acas6. Their complementation plas- 
mids were constructed with a highly efficient expression vector 
pSeSD 52 and transformed into corresponding mutants to yield 
complementation strains. Both mutants and complementation 
strains were used for northern analysis of pre-crRNA processing 
and for the interference plasmid assay. 

First, we examined pre-crRNA processing in Acas6 and its 
complementation strain. As shown in Figure 3, depleting Cas6 
completely inactivated pre-crRNA processing since neither inter- 
mediates nor mature crRNAs were present in the mutant strain. 
These bands appeared again in the complementation strain where 
the Cas6 protein was expressed from the expression plasmid pSe- 
cas6. These results indicated that cas6is essential for crRNA pro- 
duction in S. islandicus and that it is the only enzyme responsible 
for pre-crRNA processing in this organism. 

Next, we analyzed processing intermediates and crRNAs 
in Acsa5, Acas5 and Acas7 from which different results were 
obtained. Northern analysis of Acsa5 and its complementation 
strain showed the same hybridization pattern and RNA band 
intensities that were essentially identical to the results of the 
wild-type strain. Hybridization signals of processing intermedi- 
ates were absent or greatly weakened in Acas7, while crRNAs 
were maintained at very similar levels. The exactly opposite 
results were obtained for Acas5 where mature crRNAs were 
hardly detectable but normal levels of processing intermediates 
accumulated. Furthermore, genetic complementation of mutant 
deficiency by episomic expression of each corresponding gene 
from a plasmid restored the hybridization pattern of the wild- 
type strain (Fig. 3). These results indicated that Cas7 is essen- 
tial for stabilizing RNA intermediates of pre-crRNA processing 
and it could function in the absence of Csa5 or Cas5, and that 
Cas5 is essential for maintaining mature crRNAs whereas Csa5 
does not appear to have a role in pre-crRNA processing and 
maturation. 

These mutants were also tested for PAM motif-mediated DNA 
targeting. As shown in Figure 3C, whereas Acas7 and Acas5 lost 
the capability of conferring DNA interference to pCC-A2S32 
and pCC-AlS6, depleting Csa5 did not compromise the infer- 
ence activity. 

Taken together, we have demonstrated that only three of the 
four putative Cascade proteins, i.e., Cas6, Cas7 and Cas5 are 
essential for the PAM-mediated DNA interference. All these Cas 
proteins function in pre-crRNA processing in which Cas6 is the 
primary processing enzyme in vivo whereas Cas7 and Cas5 stabi- 
lize processing intermediates and crRNAs, respectively. However, 
Csa5 does not appear to have a role in either process, suggesting 
that Csa5 may not represent an integral part of the Cascade that 
mediates PAM-directed DNA silencing. 

Cas3 helicase/nuclease functions both in processing and 
targeting. Cas3 is required for all known type I CRISPR sys- 
tems for interference. In some CRISPR systems such as those in 
Sulfolobus species, Cas3 is divided into two parts: Cas3' contain- 
ing a helicase domain and Cas3" carrying a nuclease domain. To 
reveal their role in DNA interference, Acas3' and Acas3" were 



investigated for silencing of pCC-A2S32 and pCC-AlS6 inter- 
ference plasmids. As shown in Figure 3C, both mutants showed 
high transformation efficiencies with all tested plasmids, includ- 
ing both interference and control plasmids, indicating that the 
interference activity was inactivated in the mutants. 

When pre-crRNA processing was studied by northern 
analysis, both mutants showed increased levels of processing 
intermediates compared with the wild-type strain and a much 
elevated level of mature crRNA was observed for Acas3" (Fig. 3). 
Interestingly, while genetic complementation of the mutant defi- 
ciency by episomic expression of cas3' or cas3" reversed the strong 
accumulation of the intermediates, a few new RNA species (60 < 
crRNAs < 80 nt), which first appeared in Acas3' and Acas3", per- 
sisted in their complementing strains. These results suggest that 
the helicase and nuclease activities of Cas3 function in resolving 
the processing intermediates and that the nuclease domain could 
play a role in crRNA degradation. 

Discussion 

In this work, we have studied functions of different interference 
modules in pre-crRNA processing and PAM-mediated DNA 
targeting in S. islandicus REY15A. Mutants lacking each mod- 
ule were constructed and analyzed for mature crRNA and their 
processing intermediates and for DNA interference. This has 
revealed that the type IA interference module mediates PAM- 
dependent DNA targeting and it functions independently from 
other Cas/Cmr modules that co-exist in S. islandicus REY15A. 
We also show that depleting Cas6 inactivates pre-crRNA pro- 
cessing and that inactivation of Cas7, Cas5, Cas3' or Cas3" 
impairs processing of pre-crRNAs. This unravels an important 
difference between in vivo and in vitro results since biochemical 
characterization of a S. solfataricus Cas6 protein, which is 93% 
identical to this S. islandicus Cas6 in amino acid sequence, dem- 
onstrated that Cas6 was capable of generating mature crRNAs 
by itself. 25 

A Cascade complex (I A- Cascade) isolated from S. solfataricus 
was reported consisting of at least four components, Cas7, Cas5, 
Csa5 and Cas6. 25 Characterization of other type I Cascade com- 
plexes, including the IE-Cascade of E. coli and the IF Cascade of 
Pseudomonas aeruginosa, revealed five components (Csel, Cse2, 
Cas7, Cas5 and Cas6e) for type IE 19,30 and four components 
Csyl, Csy2, Csy3 and Cas6f (Csy4) for type IF. 53 Importantly, 
all Cascade components and Cas3 are shown to be essential for 
the DNA-targeting activity in IE and IF systems. 1 ' ; ' 30,54 Here, we 
show that Csa5, one of the Cascade components, is dispensable 
for the PAM-directed DNA targeting in S. islandicus. It is impor- 
tant to point out though that a functional S. solfataricus Cascade 
complex remains to be demonstrated since the isolated one did 
not show any cleavage activity. 25 

Interestingly, although non-essential to DNA interfer- 
ence, Csa5 is encoded in all known archaea carrying a type IA 
CRISPR system, including all known Sulfolobus species, 47,55 " 58 
Methanocaldococcus jannaschii^ Thermoproteus tenax n and 
four Pyrobaculum species, 60 whereas Csa5 is absent from all 
known type IB CRISPR systems, including Haloferax volcanii^ 
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Clostridium tbermocellum and Methanococcus maripaludis. 27 
Currently, the role of Csa5 in type 1A CRISPR system remains 
to be demonstrated. 

In the E. coli type IE system, cleavage of pre-crRNAs by Cas6e 
yields mature crRNAs that bind to IE-Cascade to form ribonu- 
cleoprotein complexes. 33 It has been shown that Csel component 
of the IE-Cascade recognizes the PAM motif of a protospacer 
of invasive DNAs 61 and complementarity between the crRNA 
and the protospacer then guides the cleavage of foreign DNA by 
Cas3. 31 Currently, the counterpart of Csel remains to be identi- 
fied in the Sulfolobus IA-Cascade complex if this complex func- 
tions in analogy to the E. coli IE-Cascade. 

Examination of crRNA processing in S. islandicus by northern 
hybridization showed that the processing was inactivated by Cas6 
depletion and the inactivation only occurred in Acas6 (Fig. 3). 
This is consistent with the results that P. furiosus and S. solfatari- 
cus Cas6 proteins are capable of generating crRNA in vitro. 25,62 
Together, these studies demonstrate that Cas6 is both necessary 
and sufficient for pre-crRNA processing in these archaea. 

Furthermore, our study of pre-crRNA processing in the cas 
gene knockouts of S. islandicus has revealed three interesting fea- 
tures: (1) mature crRNAs are stabilized in Acas7, (2) processing 
intermediates are stabilized in Acas5 and (3) processing interme- 
diates are accumulated in Acas3' and Acas3". Interestingly, Cas7 
forms protein filaments in S. solfataricus. 2 '' Putting all these facts 
together, possible mechanistic details are revealed for pre-crRNA 
processing in S. islandicus, (1) Immediately after transcription, 
pre-crRNAs are bound by Cas7 protein filaments, protecting the 
large RNA molecules from degradation; (2) Cas6 proteins bind 
to repeat sequences of pre-crRNAs and make a cut at each repeat, 
(3) as both Cas7 and Cas6 proteins remain bound to the pro- 
cessing products, interaction between Cas3' and Cas3" and the 
complex will facilitate the remodeling of new smaller complexes 
and (4) single spacer unit complexes can only be stabilized by 
recruiting Cas5, which either constitute the Cascade or a basis for 
generating Cascade complexes. Possibly, Cas5 may also modulate 
Cas3 activities that could lead to crRNAs degradation. 

The proposed strategy is apparently not used in the type I sys- 
tems of E. coli and P. aeruginosa because relative band intensities 
of intermediates and crRNAs did not change upon depletion of 
any component of the Cascade complexes in these organisms. 19 ' 54 
Neither has it been reported that the E. coli Cas7 forms protein 
filaments. In fact, these type I systems are further diverged, 
whereas only Cas6 is essential in pre-crRNA processing in S. 
islandicus, two protein factors are reported to be essential for 
in vivo pre-crRNA processing in the bacterial systems, namely 
Cas5 and Cas6 in E. colP ,6i and Cas6f (Csy4) and Csy2 in P. 
aeruginosa.^ 

Then, the question is why it is relevant for S. islandicus to 
employ such a strategy in crRNA processing. This archaeon 
contains three distinct interference modules that are presumed 
to employ different types of mature crRNA for their interfer- 
ence. The type IA system most likely utilizes 60—70 nt crRNAs, 
which are components of the S. solfataricus Cascade complex. 25 
The Cmr-|3 module most likely uses 30—50 nt crRNAs, which 
are associated with the S. solfataricus type IIIB Cmr reported by 



Zhang et al., 38 because the two Cmr systems share high-sequence 
similarity and exhibit the same gene synteny. Although the 
Cmr-a complex has not been isolated thus far, here we show that 
depleting the complex eliminates the smallest crRNA, which is ca. 
40—41 nt crRNA for A2S32. More recently, interference activity 
has been demonstrated for the S. islandicus Cmr-ct, which confers 
a protospacer transcript-dependent DNA interference. 50 Further 
investigation of the distinct Cas interference modules in S. islan- 
dicus will yield important insights into regulation of pre-crRNA 
processing and maturation as well as mechanisms of coordinat- 
ing different interference activities that co-exist intracellularly. 

Materials and Methods 

Sulfolobus strains, growth conditions and transformation. All 

experiments were performed with the model organism of Sulfolobus 
islandicus REY15A originally isolated from an enrichment culture 
obtained from samples collected from hotsprings in Iceland. 64 Its 
genetic host E233S is double deletion mutant carrying a large dele- 
tion in pyrEF genes and a complete deletion of lacS gene. 65 

All Sulfolobus strains used in this study are listed in Table 1. 
The strains that carry pyrEF genes either on the chromosome 
or on a plasmid were grown at 76°C in the SCV (0.2% sucrose, 
0.2% casamino acids plus 1% vitamin solution) selective 
medium, whereas the medium was supplemented with 20 (Jtg/ml 
uracil (SCVU) when growing^j/r^F-deficient strains. Sulfolobus- 
competent cells were prepared as described previously 65 and 
plasmids were introduced into the cells by electroporation and 
transformed cells were allowed to form colonies on SCV plates. 

Construction of deletion mutants of S. islandicus. A novel 
gene knockout method named marker insertion and unmarked 
target gene deletion (MID) developed by Zhang et al. 66 was 
employed to construct cas gene knockouts. In MID manipula- 
tion, each plasmid vector contains three homologous sequence 
arms, i.e., left, right and target gene arms. These homologous 
arms were carefully designed such that the insertion of a marker 
cassette would yield a functional target gene at the first step of 
recombination. Primers for amplification of these arms for each 
Cas module or cas gene by polymerase chain reaction are listed in 
Table 3. Ligation of these arms to the E. coli vector pUC19 gave 
pMID-Cascis, -Cmr-a, -Cmr-(3, -cas7, -cas5, -cas6, -csa5, -cas3' 
and -cas3". These knockout plasmids were linearized with Pvu 
II before transformation. Transformants were obtained on SCV 
plates, whereas mutant cells were recovered by counter selection 
of pyrEF with 5'-fluoroorotic acids (5-FOA). After three rounds 
of single colony purification, the mutants were analyzed for their 
mutant alleles by PCR with specific primers for each target gene 
or gene region and their mutant genotypes were verified (Fig. IB). 

Complementation plasmids were constructed with pSeSD, a 
highly efficient expression vector recently reported for Sulfolobus? 2 
The expression is under the control of P araS _ SD , a synthetic pro- 
moter that confers up to one-third of the fully induced level of 
expression in a sucrose medium. 52 Each cas gene was amplified by 
PCR from S. islandicus total DNA with a Pfu DNA polymerase 
using specific primers listed in Table 3. After cleavage with the 
enzymes indicated in each PCR primer, the DNA fragments were 
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Table 1. Strains used in this study 



Strain 


Genotype and features 


Source 


S. islandicus E233S 


ApyrEFA/acS 


Deng et al. 65 


ACRISPR 


CRISPR arrays are not detectable by PCR 


Gudbergsdottiretal. 42 


ACascis 


All genes in Cascis module, including csa3, csal, cast, cas2 and cas4 are deleted 


This work 


ACmr-ct 


Deletion of the type NIB Cmr locus with 6 genes 


This work 


ACmr-p 


Deletion of the type NIB Cmr locus with 7 genes 


This work 


Acsa5 


In-frame deletion of csa5 


This work 


Acas7 


In-frame deletion of cas7 


This work 


Acas5 


In-frame deletion of cas5 


This work 


Acas3' 


In-frame deletion of cas3' 


This work 


Acas3" 


In-frame deletion of cas3" 


This work 


Acas6 


Carrying deletion of the entire cas6 gene plus 277 bp downstream flanking sequence 


This work 



Table 2. Plasmids used in this study 



Plasmid 


Features 


Source 


pSeSD 


A Sulfolobus-E. coli shuttle vector carrying an expression cassette controlled under a synthetic strong promoter P 


Peng et al. 52 


pCC-A1S6 


An artificial interference plasmid; pSeSD carrying the interference module of the sequence of spacer 6 of array 1 preceded 

by the 5-CCA-3' motif 


This work 


pRt-A1S6 


Reference plasmid of pCC-A1S6; the PAM motif (5'-CCA-3') was replaced with the last 5 nt of the repeat sequence (5'-GAA 

AG-3') 


This work 


pCC-A2S32 


An artificial interference plasmid; pSeSD carrying the interference module of the sequence of spacer 32 of array 2 pre- 
ceded by the 5'-CCA-3' motif 


This work 


pRt-A2S32 


Reference plasmid of pCC-A2S32; the PAM motif (5'-CCA-3') was replaced with the last 8 nt of the repeat sequence (5'-ATT 

GAA AG-3') 


This work 


pSe-Csa5 


pSeSD carrying the S. islandicus csa5 gene 


This work 


pSe-Cas7 


pSeSD carrying the S. islandicus cas7 gene 


This work 


pSe-Cas5 


pSeSD carrying the S. islandicus cas5 gene 


This work 


pSe-Cas3' 


pSeSD carrying the S. islandicus cas3' gene 


This work 


pSe-Cas3" 


pSeSD carrying the S. islandicus cas3" gene 


This work 



pSe-Cas6 pSeSD carrying the 5. islandicus cas6 gene This work 



cloned to pSeSD at the compatible sites, yielding pSe-csa5, -cas5, 
-cas7, -cas3', -cas3" and -cas6. 

Plasmid interference assay of PAM-mediated DNA tar- 
geting. Artificial interference plasmids were constructed with 
two spacers, spacer 6 of CRISPR array 1 (A1S6) and spacer 32 
of array 2 (A2S32). Oligonucleotides containing the complete 
sequences of the two spacers that were preceded with 5'-CCA-3' 
or with the last five or eight nucleotides of the repeat were syn- 
thesized from TAG Copenhagen (Table 3). Each pair of prim- 
ers was annealed with each other by heating to 95°C for 10 min 
and then cooling down gradually to room temperature, yielding 
double-stranded DNAs with one blunt end and one protruding 
end, the latter of which matched the DNA ends after Mlu I 
digestion. These DNA fragments were cloned to pSeSD at Stu 
I and Mlu I sites individually, yielding plasmids containing an 
artificial protospacer (interference plasmids) and those contain- 
ing a repeat-spacer module (reference plasmids) (Table 2). 



RNA preparation and northern analysis. Total RNAs were 
extracted using the Trizol reagent (Invitrogen) following the 
protocol provided by the manufacturer. Thirty |JLg RNA was 
loaded on a 6% polyacrylamide-SDS gel for each sample and 
fractionated. RNAs in the gel were then transferred onto a nylon 
membrane and immobilized. Procedures for pre-hybridization, 
end labeling of complementary nucleotides and hybridization 
in northern analysis were described earlier 20 and the probes 
used in this work included a probe of the repeat sequence (5- 
CTT TCA ATT CTA TAG TAG ATT AGC NNN N-3') for 
detecting intermediates of pre-crRNA processing and a probe 
of spacer 32 of array 2 (5'-GCG GGA TTA GTA GGG ATT 
CCC ATA GGG CTC TAT GAA CT-3') hybridizing to mature 
crRNAs. 
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Table 3. Oligonucleotides used in this study 


Oligonucleotide 


Sequence (5"-3"; PAM motif or repeat sequence underlined, restriction site in bold) 




A1S6fwd-Mlul 


CGC GTCCAG TTG GCA TAA TCC TCA ACA TAC TTT CTT GTG TCG GGT ATG C 




A1S6rev 


GCA TAC CCG ACA CAA GAA AGT ATG TTG AGG ATT ATG CCA ACT GGA 




ReA1S6fwd-Mlul 


CGC GAA AGG TTG GCA TAA TCC TCA ACA TAC TTT CTT GTG TCG GGT ATG C 




ReA1S6rev 


GCA TAC CCG ACA CAA GAA AGT ATG TTG AGG ATT ATG CCA ACC TTT (C) 




A2S32fwd-Mlul 


CGC GTC CCA AGT TCA TAG AGC CCT ATG GGA ATC CCT ACT AAT CCC GC 




A2S32rev 


GCG GGA TTA GTA GGG ATT CCC ATA GGG CTC TAT GAA CTTGGG A 




ReA2S32fwd-Mlul 


CGC GAT TGA AAG AGT TCA TAG AGC CCT ATG GGA ATC CCT ACT AAT CCC GC 




ReA2S32rev 


GCG GGA TTA GTA GGG ATT CCC ATA GGG CTC TAT GAA CTC TTT CAA T 




Csa5fwd-Mlul 


CCA CGC GTT GGC ACA ACA AGT AAA AGA AG 




Csa5rev-Stul 


AAG GCC TTT TCT TCT CAC CAC CAC CTT G 




Cas7fwd-Ndel 


GTT GTT CCA TAT GAT AAG TGG TTC AGG TAG ATT T 




Cas7rev-Mlul 


CGA CGC GTC TAC TTT TCC TTT AAT TTA CC 




Cas5fwd-Mlul 


GGA CGC GTT GAT CTA CTC TAA GGT TTT TT 




Cas5rev-Stul 


GAG GCC TGC TAA AGA CAA CAT ATT CTC C 




Cas3'fwd-Ndel 


CCC AAA GCA TAT GTT GTC TTT AGC TGA CTT CTA T 




Cas3'rev-Mlul 


CGA CGC GTT CAA TAC ACA CCA CCT ATT TC 




Cas3"fwd-Ndel 


AAT AAA TCA TAT GAT CAA GCC TTG CGC TTA TGA G 




Cas"rev-Mlul 


CGA CGC GTT TAT AAA GTG GAA CCT CCA TT 




Cas6fwd-Ndel 


GGG ACC GCA TAT GCC ATT AAT TTT CAA GAT A 




Cas6rev-Mlul 


GTA CGC GTA CCT TTA AAG TCT GAG GA 




Cmr-a-Gfwd-Sall 


AAT TGT CGA CGC AGG AAT GGT GGT AGA GTC T 




Cmr-a-Grev-Mlul 


CCA CGC GTC CTC CCC ACT TGA ATA CTA CC 




Cmr-a-Lfwd-Ncol 


AAT TCC ATG GCA GGA AAG CAG TAG AGA AGG A 




Cmr-a-Lrev-Xhol 


ATC CTC GAG AAC TTG TAA CCC TAC GTC GTT 




Cmr-a-Rfwd-Xhol 


ATC CTC GAG ACT AAG ATG ACT AAG GAA GGC 




Cmr-a-Rrev-Sphl 


AAT TGC ATG CAC AAA AAC ATG AAC GTA TGA G 




Cmr-p-Gfwd-Sall 


AAT TGT CGA CAT AGT CTT CAT CGG CAC ATA C 




Cmr-p-Grev-Mlul 


CCA CGC GTT CAT CTG TAG AGG AGG AAA TA 




Cmr-p-Lfwd-Ncol 


AAT TCC ATG GTC AGC AGG CAG TAA AGG AAG A 




Cmr-p-Lrev-Xhol 


ATC CTC GAG CAA TGG GTA AGG TGG ATA AGA 




Cmr-p-Rfwd-Xhol 


ATC CTC GAG TTG CAT CGG GAC TAC TGT AAG 




Cmr-(3-Rrev-Sphl 


AAT TGC ATG CAG AGG GTG GGT AAG TGA TGA T 




Cassis-Gfwd-Sall 


AAT TGT CGA CAT GTT CTC CTT CAT GCC CGT T 




Cascis-Grev-Mlul 


CCA CGC GTG TCC GTT CTA ATA CCG CTC CT 




Cascis-Lfwd-Ncol 


AAT TCC ATG GTT CAT CAA GAG GAA AAT CGT C 




Cascis-Lrev-Xhol 


ATC CTC GAG TTC GTT AGA AAC ACT CGC TAG 




Cascis-Rfwd-Xhol 


ATC CTC GAG TGT ATT CAT AAG CCT CAT TCC 




Cascis-Rrev-Sphl 


AAT TGC ATG CCA ACG GAA ATA GTA GGG AAC A 




Cas3'-Gfwd-Sall 


AAT TGT CGA CAA GGT AAC GAA ACT AAA GAC T 




Cas3'-Grev-Mlul 


CCA CGC GTC ACA CCA CCT ATT TCA CTA TT 




Cas3'-Lfwd-Ncol 


AAT TCC ATG GGC TTA CAC TGA GGA CAT TGT T 




Cas3'-Lrev-Xhol 


ATC CTC GAG ACC ATG ACG AGT TAT GAA ATC 




Cas3'-Rfwd-Xhol 


ATC CTC GAG TGA TCA AGC CTT GCG CTT ATG 




Cas3'-Rrev-Sphl 


A AT — T/"/~ (IT/* ^TA Tf~f~ (~t~t TA A 1 — Vf (~t — T Tl~ A t~ 

A A 1 1 GC ATG C 1 A 1 LL LLC 1 A A 1 1 G CG 1 1 L A G 




Cas3"-Gfwd-Sall 


AAT TGT CGA CGG TGA TTG ATT CGA TTC TGG C 




Cas3"-Grev-Mlul 


CCA CGC GTA GTG GAA CCT CCA TTT AAC TC 
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Cas3"-l_fwd-Ncol 


AAT TCC ATG GGG TGA TTG ATT CGA TTC TGG C 






Cas3"-Lrev-Xhol 


ATC CTC GAG CTC ATA AGC GCA AGG CTT GAT 






Cas3"-Rfwd-Xhol 


ATC CTC GAG TTA TAA TAT ATT TGG AGA TAA 






Cas3"-Rrev-Sphl 


AAT TGC ATG CTG AAT CAA ACT GTA GAG AAT A 






Cas5-Gfwd-Sall 


AAT TGT CGA CAG TGA TCC CTA CTG GTA AAG T 






Cas5-Grev-Mlul 


CCA CGC GTA GAA GTC AGC TAA AGA CAA CA 






Cas5-Lfwd-Ncol 


AAT TCC ATG GGT TAG AAA GGC AAA AGG CTC G 






Cas5-Lrev-Xhol 


ATC CTC GAG TAA CTA CGG AAA AAC CCC AAT 






Cas5-Rfwd-Xhol 


ATC CTC GAG TTA GCT GAC TTC TAT AAC GAT 






Cas5-Rrev-Sphl 


AAT TGC ATG CGT CTT GAG CAC CAA TAC TTT T 






Cas6-Gfwd-Sall 


AAT TGT CGA CTT GCC ATC AGT TGG CTT AAT T 






Cas6-Grev-Mlul 


CCA CGC GTC CTT AAC CTT TAA AGT CTG AG 






Cas6-Lfwd-Ncol 


AAT TCC ATG GGT TAG AAA GGC AAA AGG CTC G 






Cas6-l_rev-Xhol 


ATC CTC GAG TGA AAC CGA GAT GGG ATA TAA 






Cas6-Rfwd-Xhol 


ATC CTC GAG CCT CAG ACT TTA AAG GTT AAG 






Cas6-Rrev-Sphl 


AAT TGC ATG CGG AGG GGA GAT GTT CCG ATG C 






Cas7-Gfwd-Sall 


AAT TGT CGA CAA GCC TTA GCT CAT GCC TAT 






Cas7-Grev-Mlul 


CCA CGC GTT CTT GCC CCT CTA CTT TTC CT 






Cas7-Lfwd-Ncol 


AAT TCC ATG GTC CAG GTT TAT GAA AAA TCT T 






Cas7-Lrev-Xhol 


ATC CTC GAG ATC ATT TCT TCT CAC CAC CAC 






Cas7-Rfwd-Xhol 


ATC CTC GAG GTA GAG GGG CAA GAG AGA TTT 






Cas7-Rrev-Sphl 


AAT TGC ATG CCT CAA CTT CAA CCT CCT TGC T 






Csa5-Gfwd-Sall 


AAT TGT CGA CGA GGC TGT GAC TAA GGT TCT A 






Csa5-Grev-Mlul 


CCA CGC GTT ATC ATT TCT TCT CAC CAC CA 






Csa5-Lfwd-Ncol 


AAT TCC ATG GGG AGG GAT GAG ATT AAT GAT T 






Csa5-Lrev-Xhol 


ATC CTC GAG CTT CTT TTA CTT GTT GTG CCA 






Csa5-Rfwd-Xhol 


ATC CTC GAG AGA AAT GAT AAG TGG TTC AGG 






Csa5-Rrev-Sphl 


AAT TGC ATG CTA CAT GAA ACC TCC AAT ATC A 
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